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FIG. 2 
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FIELD OF THE INVENTION 
The present invention relates to broadcast systems in general, and in 
particular to systems for collecting user feedback in a broadcast system, especially 
in a data broadcast system where feedback may be used for determining the 
5 content of future broadcasts. 

BACKGROUND OF THE INVENTION 
Data broadcasting systems have been proposed as a good solution 
for delivering data to customers while avoiding known problems associated with 

10 the Internet. In a typical data broadcasting system a single broadcast entity 
broadcasts information to a multiplicity of users, each user typically associated 
with a personal computer, a mobile computer, an interactive television, a hand- 
held communication device such as a beeper or a cellular or mobile telephone, or a 
similar device. Each user may receive those broadcast items which the user wishes 

15 to receive. Typically but not always, each broadcast item comprises a multimedia 
item. 

It is generally recognized that it would be desirable to obtain user 
feedback at the broadcast entity, the user feedback typically comprising 
information about types of information which each user would like to receive. 
20 However, because of the multiplicity of users it would apparently be inefficient 
and awkward to receive individual feedback from each user. 

One system for scheduling broadcasts using customer profiles is 
described in US Patent 5,758,257 to Herz et al. The Herz et al patent describes 
scheduling the receipt of desired movies or other forms of data by means of 
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individual customer profiles describing each individual customer. A so-called 
"agreement matrix" is calculated by comparing the recipient's profiles to the actual 
profiles of the available programs or other data. A virtual channel for each 
individual is generated from the "agreement matrix", in an attempt to satisfy the 
5 desires of each individual via their own virtual channel. 

PCT patent application PCT/IL98/00307 describes an electronic 
program guide system using an intelligent agent in which the electronic program 
guide may be customized based on user behavior. 

The following references provide a sample of the state of the art, and 
10 are useful in understanding the present invention: 

1. S. Acharya, M. Franklin and S. Zdonik. 1996. Disseminating 
updates on broadcast disks. Proceedings of the 22nd VLDB Conference, Mumbai 
(Bombay), India. 

2. S. AchaTya, R. Alonso, M. J. Franklin, and S. B. Zdonik. 1995. 
15 Broadcast disks: Data management for asymmetric communications environments. 

In M. J. Carey and D. A. Schneider, editors, Proceedings of the 1995 ACM 
SIGMOD International Conference on Management of Data, pages 199-210, San 
Jose, California, 1995. SIGMOD Record 24(2). 

3. W. P. Birmingham, E. H. Durfee, T. Mullen, and M. P. Wellman. 
20 1995. The distributed agent architecture of the University of Michigan Digital 

Library (UMDL). In Spring Symposium Series on Software Agent 

4. E. Bloedom, I. Mani and T. R. MacMillan. 1996. 
Representational issues in machine learning of user profiles. AAAI Spring 
Symposium on Machine Learning in Information Access, 
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5. E. W. Brown, J. P. Callan, B. Croft. 1994. Fast incremental 
indexing for full-text information retrieval. Proceedings of VLDB Conf pages 192- 
202. 

6. cc:Browser. 1997. http://www.faulkner.com/ 

5 7. S. D. Cohen and L. J. Guibas. 1996. Shape-based illustration 

indexing and retrieval some first steps Stanford University, Stanford, ARPA Image 
Understanding Workshopl996. 

8. CNN++ News filtering Agent, http://www.cs.umbc.edu/~smart- 

ht/info.html 

10 9. D. Cutting and J. Pederson. 1990. Optimizations for dynamic 

inverted index maintenance. Proceedings of SIGIR, pages 405-41 1. 

10. Y. Ejgenberg and Y. Lindel. 1997. B.S.c project, Computer 
Science Department at Bar Ilan University. 

11. P. Edwards, D. Bayer, C. L. Green and T. R. Payne. 1996. 
15 Experience with learning agents which manage Internet-based information. AAAI 

Spring Symposium on Machine learning in IA 9 Scotland. 

12. C. Faloutsos and D. W. Oard. 1995. A survey of information 
retrieval and filtering methods. University of Maryland CS-TR-3514. 

13. W. B. Frakes and R. Baeza- Yates. 1992. Information Retrieval: 
20 Data Structures & Algorithms. Prentice Hall Englewood Cliffs, New Jersey. 

14. K. C Gowda and G. Krishna. 1978. Agglomerative clustering 
using the concept of mutual nearest neighbourhood. Pattern Recognition ,Vol. 10, 

No 2, pp. 105-112 . 

15. S. Gam. ^WEbFilter: An agent for filtering the Web. 



3 



http://ils.unc.edu/webfilter/ 

16. D. E. Goldberg. 1989. Genetic Algorithm in Search, 
Optimization and Machine Learning. Addison- Wesley. 

17. T. Imielinski , S. Viswanathan. 1994. Adaptive wireless 
5 information systems. In Proceedings of GIGDBS Conference, Tokyo, Japan. 

18. I. Ink and N. McGough. 1992. Robots and mail filtering 
http://www.ii.com/internet/robots/ 

19. A. Kobsa. 1994. User modeling and user-adapted interaction. In 
Proceedings of ACM CHV94 Conference on Human Factors in Computing 

10 Systems, Volume 2 of TUTORIALS, pages 415-416. 

20. K. Lang. 1995. News Weeder: Learning to filter netnews. 
Proceedings of INT Conference of Machine Learning, PP. 331-339. 

21. W. Y. Ma and B. S. Manjunath. 1995. Image indexing using a 
texture dictionary. Proceedings of SPIE Conference on Image Storage and 

15 Archiving System ,Vol. 2606, pp. 288-298. Philadelphia Pennsylvania. 

22. P. Maes and R. Kozierok. 1993. Learning interface agents. 
Proceedings of AAAI-93 Washington DC pp. 459-465. 

23. B. S. Manjunath and W. Y. Ma. 1996. Texture features for 
browsing and retrieval of image data. IEEE T-PAMI special issue on Digital 

20 Libraries. 

24. A. Moukas. 1996. Amalthaea: Information discovery and 
filtering using a multi-agent evolving ecosystem. The first international conference 
on the Practical Application of Intelligent Agents and Multi Agents Technology. 
pp. 421-436. 



I 



25. S. J. Pelletier and J. F. Arcand. 1996. STEALTH: A personal 
digital assistant for information filtering. The first international conference on the 
Practical Application of Intelligent Agents and Multi Agents Technology, pp. 455- 
474. 

5 26. E. Rasmussen. 1992. Information Retrieval Data Structures and 

Algorithms. Editors: W. B. Frakes and R. Baeza- Yates Prentice Hall Inc., 
Englewood Cliffs, N. J. 

27. P. Resnick, N. Iacovou, M. Suchak, P. Bergstrom and J. Riedl. 
1994. GroupLens: An open architecture for collaborative filtering of netnews. 

10 Proceeding of ACM CSCW 94 Conference on Computer Supporter Cooperative 
Work. pp. 175-186. 

28. C. J. Van Rijsbergen. 1979. Information Retrieval-Second 
Edition. Butterworth & Co (Publisher) LTD. 

29. G. Salton. and M.J. McGill. 1983. Introduction to Modern 
15 Information Retrieval McGraw-Hill. 

30. U. Shardanand and P. Maes. 1995. Social information filtering: 
Algorithms for automating "Word of Mouth". ACM CHI' 9 5 MOSAIC OF 
CREATIVITY, pages 210-217. 

31. B. D. Sheth, 1994. A Learning Approach to Personalized 
20 Information Filtering. Master's thesis, MIT Media Lab. 

32. Study Group on Broadcasting System in the Multimedia Age 
(SGBSMA). 1995. Report by the Study Group on Broadcasting System in the 
Multimedia Age. Ministry of Posts and Telecommunication (MPT) of Japan, Press 
Release. Http://www.mpt.go.jp/Report/B^ 
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33. L. Terveen, W. Hill, B. Amento, D. MacDonald, and J. Creter. 
1997. PHOAKS: A system for Sharing recommendations. Communication of the 
ACM, vol.40, no. 3, pages 59-62. 

34. Universal Browser for Listening. 1997. 
5 http : //www. austinre . com/cgi/browser.cgi . 

35. J. Vassileva. 1994. A practical architecture for user modeling 
in a hypermedia-based information system. In Proceedings of the 4 th International 
Conference on user Modeling, pages 1 15-120. MITRE Corporation. 

36. J. M. Vidal and E. Durfee. 1995. Task planing agent in the 
10 UMDL. In Proceedings of the 1995 Intelligent Information Agents Workshop. 

37. Video On Line. 1995. http://www.vol.it/VOLB/browser.html. 

38. W. Wahlster. 1994. Toward personalized multimedia. In 
3Proceedings of the 4 th International Conference on User Modeling, pages 3-4. 
MITRE Corporation. 

15 39. D. Walley. 1996a. Data broadcasting vs. conventional on-line 

services. Http://www.mbnet.mb.ca/-walleydc/advanta.html. 

40. D. Walley. 1996b. Free commercial mass-audience datacasting 
of popular information and software direct to personal computers. 

4 1 . Http://ww.mbnet.mb.ca/'-walleydc/int_intr.html. 

20 42. A. White and R. Jain. 1996. Algorithm and strategies for 

similarity retrieval. University of California, San Diego. 

43. Bruce Krulwich; 1997; Lifestyle Finder, Intelligent user 
profiling using large scale demographic data. 

44. G. Paliouras, C. Papatheodorou, V. Karkaletsis, C. Spyropoulos, 
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and V. Malaveta; "Learning user communities for improving the services of 
information providers". 

The disclosures of all references mentioned above and throughout 
the present specification are hereby incorporated herein by reference. 
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SUMMARY OF THE INVENTION 
The present invention seeks to provide an improved system for 
providing feedback in a broadcast system, particularly in a data broadcast system. 
In the present invention a user profile subsystem, preferably comprising a 

5 hierarchy of cooperating agents, is used to collect and aggregate user feedback for 
delivery to a broadcaster. Preferably, the broadcaster uses the user feedback for 
determining the content of future broadcasts. 

There is thus provided in accordance with a preferred embodiment 
of the present invention a system for collecting user feedback in a data 

10 broadcasting system, the system for collecting user feedback including a 
multiplicity of user profile agents, each user profile agent being associated with 
one of a multiplicity of users of the data broadcasting system and being operative 
to create a user profile based on activity of the one user, a user profile subsystem 
associated with a plurality of user profile agents chosen from among the 

15 multiplicity of user profile agents and operative to derive an integrated user profile 
based on the plurality of user profiles created by the plurality of user profile 
agents, and a broadcasting agent operatively associated with a broadcast center of 
the data broadcasting system and in operative communication with the user profile 
subsystem and receiving therefrom the integrated user profile. 

20 Further in accordance with a preferred embodiment of the present 

invention the user profile subsystem includes a plurality of clustering agents, each 
clustering agent being operatively associated with a subset of user profile agents 
chosen from among the plurality of user profile agents and being operative to 
derive a preliminary community profile based on the plurality of user profiles 

8 
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created by the subset of user profile agents. 

Still further in accordance with a preferred embodiment of the 
present invention the system also includes at least one union set agent in operative 
communication with at least two of the plurality of clustering agents and operative 
5 to coordinate the operation of the at least two clustering agents. 

Additionally in~ accordance with a preferred embodiment of the 
present invention the union set agent is operative to coordinate the operation of the 
at least two clustering agents by coordinating at least one characteristic of the 
preliminary community profile derived by the clustering agents. 

10 Moreover in accordance with a preferred embodiment of the present 

invention the system also includes a community profile agent operative to derive a 
community profile from the preliminary community profile. 

Further in accordance with a preferred embodiment of the present 
invention the integrated user profile includes at least one community profile. 

15 There is also provided in accordance with another preferred 

embodiment of the present invention a method for collecting user feedback in a 
data broadcasting system, the method including providing a multiplicity of user 
profile agents, each user profile agent being associated with one of a multiplicity 
of users of the data broadcasting system, creating, using a plurality of user profile 

20 agents from among the multiplicity of user profile agents, a plurality of user 
profiles based on activity of a plurality of users, deriving, using a user profile 
subsystem associated with a plurality of user profile agents chosen from among the 
multiplicity of user profile agents, an integrated user profile based on the plurality 
of user profiles created by the plurality of user profile agents, and utilizing a 
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broadcasting agent operatively associated with a broadcast center of the data 
broadcasting system and in operative communication with the user profile 
subsystem and receiving therefrom the integrated user profile. 

Further in accordance with a preferred embodiment of the present 
invention the user profile subsystem includes a plurality of clustering agents, each 
clustering agent being operatively associated with a subset of user profile agents 
chosen from among the plurality of user profile agents, and the deriving step 
includes deriving the integrated user profile based on the plurality of user profiles 
created by the subset of user profile agents. 

Still further in accordance with a preferred embodiment of the 
present invention the deriving step also includes coordinating the operation of at 
least two of the plurality of clustering agents using at least one union set agent in 
operative communication with at least two of the plurality of clustering agents. 

Additionally in accordance with a preferred embodiment of the 
present invention the method also includes the union set agent coordinating the 
operation of the at least two clustering agents by coordinating at least one 
characteristic of a preliminary community profile derived by the clustering agents. 

Moreover in accordance with a preferred embodiment of the present 
invention the integrated user profile includes at least one community profile. 

Further in accordance with a preferred embodiment of the present 
invention the deriving step includes a community profile agent deriving the at least 
one community profile from a preliminary community profile. 

Still further in accordance with a preferred embodiment of the 
present invention the utilizing step includes modifying at least one broadcast 
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program based, at least in part, on the integrated user profile. 

Additionally in accordance with a preferred embodiment of the 
present invention the modifying includes assigning, to a program associated with a 
first community, items scheduled to be broadcast according to a program 
5 associated with a second community. 

Moreover in .accordance with a preferred embodiment of the present 
invention the assigning includes assigning based on a measure of relevancy to the 
first community. 

There is also provided in accordance with another preferred 
10 embodiment of the present invention a method for collecting user feedback in a 
data broadcasting system, the method including deriving a user profile for each of 
a multiplicity of data item users, clustering user profile information from at least 
some of the multiplicity of data item users to produce preliminary community 
information, modifying the preliminary community information to produce 
15 community information, and ufilizing the community information for modifying a 
broadcast schedule. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
The present invention will be understood and appreciated more fully 
from the following detailed description, taken in conjunction with the drawings in 
which: 

Fig. 1 is a simplified pictorial illustration of an agent-based user 
feedback collection system constructed and operative in accordance with a 
preferred embodiment of the present invention; 

Fig. 2 is a simplified flowchart illustration of a preferred method of 
operation of the system of Fig. 1; 

Fig. 3 is a simplified pictorial illustration of a preferred embodiment 
of a user profile, useful in understanding the method of Fig. 2; 

Fig. 4 is a simplified pictorial representation of a multimedia 
document representation, useful in understanding the method of Fig. 2; and 

Fig. 5 is a simplified flowchart illustration of a preferred method of 
operation of a portion of step 200 of Fig. 2. 
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DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 

Reference is now made to Fig. 1 which is a simplified pictorial 
illustration of an agent-based user feedback collection system constructed and 
operative in accordance with a preferred embodiment of the present invention. 

5 The system of Fig. 1 comprises a plurality of user stations 100, each of the 
plurality of user stations 100 being operatively associated with a user profile agent 
110. It is appreciated that, typically, a multiplicity of user stations 100, such as, 
for example, 1 million or more user stations 100, is provided, a plurality of user 
stations 100 being shown in Fig. 1 for sake of simplicity of description. It is also 

10 appreciated, as also described below, that other agents (not shown) in addition to 
the user profile agent 110, may also be operatively associated with each of the 
plurality of user stations 100. 

Each of the user stations 100 preferably comprises an appropriate 
computing system, such as a multimedia computing system as is well-known in the 

15 art, equipped and operative to receive broadcasts, typically comprising multimedia 
data broadcasts, from a broadcast center 120. It is appreciated, as is well known in 
the art, that the broadcast center 120 may broadcast using any one or more of a 
variety of appropriate broadcast methods, including conventional terrestrial 
broadcast, digital terrestrial broadcast, satellite broadcast, cable broadcast, or any 

20 other appropriate broadcast method, It is further appreciated that each user station 
100 is preferably equipped to receive broadcasts of a type broadcast by the 
broadcast center 120. 

Each of the user profile agents 110 is preferably implemented in a 
combination of computing hardware and software, as is well known in the art, and 
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is preferably operative to observe the behavior of a user of the associated user 
station 100 and to create a user profile, as described more fully below, 
characterizing the preferences of the user of the associated user station 100. 

The system of Fig. 1 also comprises a user profile subsystem 130. 
5 The user profile subsystem 130 is preferably implemented in an appropriate 
combination of computing hardware and software, as is well known in the art, and 
is typically located remotely from each of the user stations 100, and in operative 
communication therewith. The user profile subsystem 130 is operative to receive a 
user profile from each of the user profile agents 110, to analyze the received 
10 profiles, and to derive therefrom a plurality of community profiles characterizing 
the multiplicity of user stations 100 in accordance with a plurality of communities, 
each community preferably sharing a set of preferences and characteristics. 
Typically, the user profile subsystem 130 produces an output comprising 
community information, preferably in the form of a plurality of community 
15 profiles each characterizing one community. 

The user profile subsystem 130 typically comprises a plurality of 
agents, as described below. The plurality of agents comprised within the user 
profile subsystem 130 may, when referred to generally, be referred to as 
"integrated user profile agents". For simplicity of description, the integrated user 
20 profile agents are collectively referred to herein as the "user profile subsystem 
130", it being appreciated that, because of possible separate implementation and 
possible geographic dispersal of the integrated user profile agents, the term "user 
profile subsystem 130" is itself a term which has been adopted mainly for the sake 
of simplicity of description. 
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The user profile subsystem 130 typically comprises a plurality of 
clustering agents 140, each clustering agent 140 being operatively associated with 
a subset of the user stations 100 and preferably being operative to produce one or 
more preliminary community profiles based on information received from the 
associated user profile agents 1 10 associated with each of the user stations 100 in 
the subset of the user stations 100. 

The user profile subsystem 130 also typically comprises at least one 
union set agent 150. The union set agent 150 is preferably operative to cooperate 
with a plurality of clustering agents 140 and to adjust the operation thereof in such 
a way that community information produced by each clustering agent 140 is 
compatible with, and tends to form the same communities as, that produced by 
each other clustering agent 140. Specifically, and without limiting the generality 
of the foregoing, the union set agent 150 is preferably operative to examine 
community profiles produced by a plurality of clustering agents 140, to identify 
similar community profiles produced by different clustering agents 140, to unify 
the similar community profiles, and to report the unified profiles back to each of 
the different clustering agents 140 so that the different clustering agents 140 will 
operate with the same community profiles. 

The system of Fig. 1 also comprises at least one community agent 
155, typically comprising a plurality of community agents 155. Each community 
agent 155 is preferably associated with one community profile produced by one or 
more clustering agents 140 as described above, and is preferably operative to 
aggregate information relating to the one community profile as reported by the one 
or more clustering agents 140. 



The system of Fig. 1 also comprises a broadcasting agent 160, which 
is typically implemented in an appropriate combination of computer hardware and 
software. The broadcasting agent 160 preferably receives the community 
information produced by the user profile subsystem 130 and is preferably 

5 operative, based on the received community information, to modify the operation 
of the broadcast center 120 to take into account community preference information 
comprised in the community information. By way of example only and without 
limiting the generality of the foregoing, modifying the operation of the broadcast 
center 120 may include modifying the content of future broadcasts. 

10 The operation of the apparatus of Fig. 1 is now briefly described. 

The broadcast center 120 broadcasts a plurality of broadcast items to the 
multiplicity of user units 100. The user of each user unit 100 elects to receive one 
or more broadcast items; it is appreciated that, in some cases, a user may elect not 
to receive any items. It is appreciated, as is well known in the art, that a user may 

15 elect to receive a broadcast item using any appropriate technique, such as, for 
example: the user may select an individual item or a group of items in advance; 
the user may select an individual item or a group of items on demand; the user may 
program the user unit 100 to select a particular item, an item matching certain 
criteria, or an item broadcast at a particular time; or any appropriate combination 

20 of the above, including a combination with other techniques well known in the art. 
It is also appreciated that a filter agent (not shown) may automatically download, 
accept, or record only those items which are deemed to be of interest to the user 
based on criteria determined by the filter agent, by other agents, or otherwise, 
particularly as described below. 
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Each user profile agent 110 is preferably operative, as described 
more fully below, to create a user profile describing the item-selection preferences 
of the user of the associated user unit 100. Each user profile agent 1 10 then passes 
on the user profile to the user profile subsystem 130, specifically to one of the 

5 clustering agents 140. Each clustering agent 140 is preferably operative, as 
described more fully below, to cluster information from a plurality of user profiles 
and to create therefrom community profile information, representing a clustered 
aggregate of information about preferences of a plurality of users. 

Preferably, the union set agent 150 is operative, as described more 

10 fully below, to coordinate the activities of the clustering agents 140 in order to 
assure that uniform community profile information, representing clustered 
information across all of the user units 100, is produced. Community information 
from a plurality of clustering agents 140 is then typically aggregated into a 
community profile for each community by one or more community agents 155, 

15 each community agent 155 preferably being associated with one community 
profile. The community information, typically in the form of the community 
profiles, is then passed on the broadcasting agent 160, where it is used as input in 
broadcast decisions of the broadcast center 120, including decisions as to the 
content of future broadcasts. 

20 Reference is now ma de to Fig. 2, which is a simplified flowchart 

illustration of a preferred method of operation of the system of Fig. 1 . The method 
of Fig. 2 preferably comprises the following steps: 

A user profile is derived for each user of a multiplicity of data item 
users (step 170). It is appreciated that a wide variety of different methods, 



including methods known in the art, may be used to derive a user profile, and that 
a user profile may have any one of a wide variety of structures, including 
structures which are well known in the art. It is further appreciated that not every 
user need have exactly the same user profile structure as every other user. 

5 Reference is now additionally made to Fig. 3, which is a simplified 

pictorial illustration of a preferred embodiment of a user profile, useful in 
understanding the method of Fig. 2. In the example user profile of Fig. 3, a profile 
is depicted as a tree, in which each level represents a different level of information 
about user preferences, each node in the tree being represented as a vector, the sum 

10 of whose component values is 1.0. Reference is now additionally made to Fig. 4, 
which is a simplified pictorial representation of a multimedia document 
representation, useful in understanding the method of Fig. 2. It is appreciated that 
the example document representation of Fig. 4 is generally compatible with the 
example user profile of Fig. 3. 

15 Referring back to Fig. 2, user profile information from at least some 

of the multiplicity of data item users is clustered together to produce preliminary 
community information (step 180). The clustering in step 180 may use any 
appropriate clustering method for combining together similar profiles. Preferably, 
an appropriate clustering method may be chosen from the field of information 

20 retrieval technology, with the additional feature that more than one level of user 
profile, such as more than one level of the example user profile of Fig. 3, is used 
for clustering. 

Clustering of user profiles preferably is based on similarity between 
user profiles. Similarity between two user profiles, based on user classes and 
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subclasses, may preferably be computed in accordance with the following formula: 

similarity(userl,user2) = 

X X userl _ classvuserl _ subclassij xuser2 _ classiuser2 _ subclassij 
CSi 

JX X (userl _ classi userl _ subclassij)^ • X X (user2 _ classi-userl _ subclassij) 
CSi CSi 

where userl and user2 are class vectors of two user profiles, each user profile 
having a plurality of subclasses, C being the number of classes and Si the 
corresponding number of subclasses. It is appreciated that the above formula 
comprises one example of a suitable formula, and that other formulas may be used. 

Preferably, clustering is carried out using an appropriate 
agglomerative clustering method, as is well known in the art. One example of an 
appropriate method for clustering is as follows: 

0. Identify each user profile as a point in the space to be merged. 

1 . Initialize each point as a cluster. 

2. Find the two closest clusters in the space and merge them 

together. 

3. Compute the distance from the newly formed cluster to all other 

clusters. 

4. If more than one cluster remains, return to step 2. 

Typically, a method such as that described above outputs a complete 
binary tree wherein each internal node of the tree represents a merge between two 
cluster and the leaves are the points in the space, or user profiles. Typically, the 
lower levels in the tree are small tight clusters while the higher levels are larger, 
general clusters. The indicated method, which operates on the available data and 
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is not limited by external parameters, is considered, based on the work of the 
inventors of the present invention, to be preferred in the present invention. 

In the method described above, a measure of closeness or distance is 
important in determining which clusters to merge. A number of ways of 
5 determining distance are well known in the art. In the present invention, it is 
believed to be preferred to use a "complete link" method, in which the distance 
between two clusters is the distance between their two furthest points. This 
method is believed likely to produce bound clusters, as opposed to long, "straggly" 
clusters. 

10 An approach to expressing the method described above is to use the 

Lance Williams dissimilarity update formula, which is well-known in the art. The 
formula enables incremental recalculation of cluster distances. The Lance- 
Williams dissimilarity update formula is given as follows: 

d(ci, j,ck) = cci * d(ci,ck) + aj • d(cj,ck) + p • d(ci,cj) + y|d(ci,ck) - d(cj,ck)| 
15 where, for the preferred case of complete link, the following parameters are 

preferably used: 

. 1 
ai = — 
2 

p=o 

1 

Y = 2 

20 The preliminary community information is modified to produce 

community information (step 190). Referring back to Fig. 1, it is appreciated that 
each clustering agent 1 40 is operative to produce preliminary community 
information relating to a plurality of associated user stations 100, which plurality 

20 
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of associated user stations 100 is generally disjoint from each other plurality of 
user stations 100 associated with another clustering agent 140. For reasons of 
efficiency, it is desirable nonetheless for the preliminary community information 
produced by each clustering agent 140 to be modified and made uniform with 
community information produced by other clustering agents 140. Preferably, the 
union set agent 150 is primarily operative to carry out step 190, with final 
agglomeration at the community level being carried out by the community agent 
1 55, as described above. 

Referring back to Fig. 2, in step 190, preferably preliminary 
community information, preferably in the form of preliminary community profiles 
produced by a plurality of clustering agents, is unified preferably using methods 
similar to those described above for user profiles in step 180. Preferably, as 
described above with reference to Fig. 1, the union set agent sends information 
back to the clustering agents, the information comprising updated community 
profile information which replaces the preliminary community profile information. 
In this way, as described above with reference to Fig. 1, different clustering agents 
will produce information agglomerated into the same community profiles. In this 
way, since each community agent is associated with one community profile, the 
union set agents preferably dSermine the number of community profiles and the 
number of community agents. 

The community information is then utilized for modifying a 
broadcast schedule (step 200). Typically, the broadcasting center has a list of 
items to broadcast, the list being made available to a broadcasting agent. It is 
appreciated that each item, sometimes referred to herein as a "document", may 
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comprise any article or item typically broadcast by a broadcaster, such as, for 
example, a document, an image, a video, a computer program, a television 
program, an update to any of the previously listed items, or any other appropriate 
article or item. The agent attempts to prepare, based on document information and 

5 on community profiles, a broadcasting program which will maximize the 
satisfaction of the user community. Preferably, this task comprises extracting a 
multimedia document representation based on information that accompanies the 
multimedia document; and scoring all documents and preparing a broadcast 
program for each community based on the results of the scoring. 

10 The broadcasting agent preferably determines the similarity between 

the community profiles and the document collection. Different communities rate 
the documents differently. Therefore, for each document in the document 
collection, the broadcasting agent prepares a vector of scores with respect to each 
community profile. In other words, for each document we have a vector of scores 

15 where score st is the score that the document received by community /. A 
preferable function that the broadcasting agent may use for scoring the documents 
is as follows: 



Scorecommunity_ profile, document = 
Community_Class_Weightc x 
Community_Subclass_ Weights x 




22 



where: 

Community_Class_Weightc is the weight of class c according to the 
community profile; 

Community_Subclass_Weights is the weight of subclass s of class c 
5 of the community profile; 

Com_prof_descd is the description field's weight of the community 

profile; 

D is the number of description fields in the description vector of the 

profile; and 

10 K is the number of terms in the term vector. 

A broadcast program is then chosen for each community. Reference 
is now additionally made to Fig. 5, which is a simplified flowchart illustration of a 
preferred method of operation of a portion of step 200 of Fig. 2. The method of 
Fig. 5 comprises preferred method for choosing a broadcast program for each 

15 community. The method of Fig. 5 is self explanatory, except as described below. 

Preferably, after an initial community program is prepared for each 
community, an attempt is made to extend the programs to be broadcast by scanning 
the list of documents selected to be broadcast for all users and assigning, to each 
community, additional relevant documents that were selected to be broadcast but 

20 were not assigned to that community program. In this context, a relevant 
document for a community is preferably defined as a document whose score for 
that community exceeds a relevancy threshold, the relevancy threshold typically 
comprising a parameter pre-defined by an operator of the system. 

It is appreciated that various features of the invention which are, for 



clarity, described in the contexts of separate embodiments may also be provided in 
combination in a single embodiment. Conversely, various features of the invention 
which are, for brevity, described in the context of a single embodiment may also be 
provided separately or in any suitable subcombination. 
5 It will be appreciated by persons skilled in the art that the present 

invention is not limited by what has been particularly shown and described 
hereinabove. Rather the scope of the invention is defined only by the claims which 
follow: 
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What is claimed is: 

CLAIMS 

1 . A system for collecting user feedback in a data broadcasting system, 
the system for collecting user feedback comprising: 

5 a multiplicity of user profile agents, each user profile agent being 

associated with one of a multiplicity of users of the data broadcasting system and 
being operative to create a user profile based on activity of the one user; 

a user profile subsystem associated with a plurality of user profile 
agents chosen from among the multiplicity of user profile agents and operative to 
10 derive an integrated user profile based on the plurality of user profiles created by 
the plurality of user profile agents; and 

a broadcasting agent operatively associated with a broadcast center 
of the data broadcasting system and in operative communication with the user 
profile subsystem and receiving therefrom said integrated user profile. 

15 

2. A system according to claim 1 and wherein the user profile 
subsystem comprises a plurality of clustering agents, each clustering agent being 
operatively associated with a subset of user profile agents chosen from among the 
plurality of user profile agents and being operative to derive a preliminary 

20 community profile based on the plurality of user profiles created by the subset of 
user profile agents. .t_ ; - . 

3. A system according to claim 2 and also comprising at least one 
union set agent in operative communication with at least two of said plurality of 
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clustering agents and operative to coordinate the operation of said at least two 
clustering agents. 



4. A system according to claim 3 and wherein said union set agent is 
5 operative to coordinate the operation of said at least two clustering agents by 

coordinating at least one characteristic of the preliminary community profile 
derived by said clustering agents. 

5. A system according to claim 4 and also comprising a community 
10 profile agent operative to derive a community profile from the preliminary 

community profile. 

6. A system according to any of claims 1-5 and wherein said 
integrated user profile comprises at least one community profile. 

15 

7. A method for collecting user feedback in a data broadcasting 
system, the method comprising: 

providing a multiplicity of user profile agents, each user profile 
agent being associated with one of a multiplicity of users of the data broadcasting 
20 system; 

creating, using a plurality of user profile agents from among the 
multiplicity of user profile agents, a plurality of user profiles based on activity of a 
plurality of users; 

deriving, using a user profile subsystem associated with a plurality 
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of user profile agents chosen from among the multiplicity of user profile agents, an 
integrated user profile based on the plurality of user profiles created by the 
plurality of user profile agents; and 

utilizing a broadcasting agent operatively associated with a 
5 broadcast center of the data broadcasting system and in operative communication 
with the user profile subsystem and receiving therefrom said integrated user 
profile. 



8. A method according to claim 7 and wherein the user profile 
10 subsystem comprises a plurality of clustering agents, each clustering agent being 

operatively associated with a subset of user profile agents chosen from among the 
plurality of user profile agents, and 

the deriving step comprises deriving the integrated user profile 
based on the plurality of user profiles created by the subset of user profile agents. 

15 

9. A method according to claim 8 and wherein the deriving step also 
comprises coordinating the operation of at least two of said plurality of clustering 
agents using at least one union set agent in operative communication with at least 
two of said plurality of clustering agents. 

20 

10. A method according to claim 9 and also comprising said union set 
agent coordinating the operation of said at least two clustering agents by 
coordinating at least one characteristic of a preliminary community profile derived 
by said clustering agents. 



11. A method according to any of claims 7-10 and wherein said 

integrated user profile comprises at least one community profile. 

5 12. A method according to claim 11 and wherein the deriving step 

includes a community profile agent deriving said at least one community profile 
from a preliminary community profile. 

13. A method according to any of claims 7-12 and wherein the 
10 utilizing step comprises modifying at least one broadcast program based, at least in 

part, on the integrated user profile. 

14. A method according to claim 13 and wherein said modifying 
comprises assigning, to a program associated with a first community, items 

15 scheduled to be broadcast according to a program associated with a second 
community. 

15. A method according to claim 14 and wherein said assigning 
comprises assigning based on a measure of relevancy to the first community. 

20 

16. A method for collecting user feedback in a data broadcasting 
system, the method comprising: 

deriving a user profile for each of a multiplicity of data item users; 
clustering user profile information from at least some of the 
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multiplicity of data item users to produce preliminary community information; 

modifying the preliminary community information to produce 
community information; and 

utilizing the community information for modifying a broadcast 

5 schedule. 

17. Apparatus according to any of claims 1 - 6 and substantially as 

described hereinabove. 

10 18. Apparatus according to any of claims 1 - 6 and substantially as 

shown in the drawings. 

19. A method according to any of claims 7-16 and substantially as 
described hereinabove. 

15 

20. A method according to any of claims 7-16 and substantially as 
shown in the drawings. 
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